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Abstract 

Consider an invertible measure-preserving transformation of a probability 
space. A finite partition of the space is called weakly independent if there are 
infinitely many images of this partition under powers of the transformation 
that are jointly independent. Krengel proved that a transformation is weakly 
mixing if and only if weakly independent partitions of the underlying space 
are dense among all finite partitions. Using the tools developed in the later 
papers of del Junco- Reinhold- Weiss and del Junco-Begun we obtain Krengel- 
type results for weakly mixing random dynamical systems (or equivalently, 
skew products that are relatively weakly mixing). 
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0. Introduction 

The notion of weak mixing is one of the strengthenings of the notion of 
ergodicity of a measure-preserving transformation. Along with the notions of 
mixing, if-property etc. it belongs to the hierarchy of statistical properties 
of transformations in ergodic theory. This notion complements the notion of 
discrete spectrum and has several equivalent characterizations mostly going 
back to Koopman and von Neumann (early 1930's). In 1970 Krengel proved 
in [K] another unexpected characterization of weak mixing. In this paper we 
prove two analogues of Krengel's result for systems that are weakly mixing 
relative to a factor. Our claim can be alternatively stated in the language 
of random dynamical systems. Before presenting the original theorem and 
its relativization, we provide necessary definitions concerning partitions of 
probability spaces (a partition is a representation of the space as a finite 
disjoint union of measurable sets). 

A family of measurable sets in a probability space (X, m) is independent 
if for every finite sub- family Ai, A 2 , . . . , A^ we have 

k k 
m([)A i ) = l[m(A i ). 

i=i i=i 

A family of measurable partitions in X is independent if every family of sets 
of the partitions - one set from each partition - is independent. If P = 
{A\, . . . ,A n } is an ordered partition then distP, the distribution of P, is a 
vector in R n defined by (dist P)i = m{Aj). Consequently, if P = {Ai, . . . , A^] 
and Q = {Pi, . . . , P^} are two partitions with the same number of atoms, we 
will say that P and Q have the same distribution if ra(Aj) = m(Bi) for all i. 
The set of all (ordered) partitions into n atoms can be turned into a complete 
metric space with the metric d m (P,Q) = ELi Wherever there 

is no ambiguity we will omit the subscript and denote this distance by plain 
d. 

By a (dynamical) system we mean a quadruple X = (X, B, m, T) where 
(X,B,/jl) is a probability space and T is an sutomorphism of (X,B,/jl), that 
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is T is an invertible ^-preserving map. A sequence {n^} C Z is called mixing 
for T if 

fi(A n T m B) -»• fi(A)fi(B) asi^oo. 

One characterization of weak mixing of T is that the product system (X x 
X, B x £>, fi x /i, T x T) is ergodic. An equivalent one is that there exists a 
mixing sequence {rii} C N for T having density one in N or equivalently there 
exists any mixing sequence at all. 

Theorem (Krengel). (X, B, m, T) is a weakly mixing system if and only if 
for every finite measurable partition P of X and e > 0, there is an infinite 
sequence = no < n\ < . . . and a partition P' with the same distribution as 
P such that d(P, P') < e and the partitions {T~ ni P'}°^ are jointly indepen- 
dent. 

Krengel's very technical proof was substantially simplified in 1999 by del 
Junco, Reinhold and Weiss ([JRW]), paving the way for generalizations and 
refinements. Among other things it was shown there that the sequence in 
Krengel's theorem can be chosen inside any prescribed mixing sequence for 
T. The key tool for this was an extension result for stationary measures on 
A z . In the subsequent paper [BJ] the extension theorem was generalized to 
A G , G any discrete amenable group, permitting the Krengel theorem to be 
generalized to free actions of discrete amenable groups. 

We restrict our attention to systems (X, B, fi, T) such that X is a com- 
plete metric space and B is the ^-completion of the Borel cr-algebra of X. We 
will call such systems standard. This assumption involves no loss of generality 
for theorems of the kind we are concerned with here. We will be studying the 
properties of (X, B, fi, T) relative to a distinguished factor algebra T . There 
is no loss of generality in assuming that T is given by T = t\~ x T ^ where 

7T : x -> n = (n,T,F,e) 

is a factor map from X to another standard system fl. The sets 7r _1 {w}, uj g 
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Q, are the fibers of T. The measure fi decomposes over the factor as 

Jn 

where is supported on the fiber 7r _1 {c<;}. 

As is well-known, when the factor system is ergodic, we can represent X 
as a skew- pro duct over Q: 

(X,B,fi) =(n,F,F)x(Y,C,v), 

T{u,y) ={eu,T„y), 

where T w is an automorphism of (Y, C : u) which depends measurably on u. 
The pair (9, {T u }) is called a random dynamical system. 

From now on we identify T with T and write T for both. One definition 
of relative weak mixing uses the relative product of (X, B, /jL,T) = X with 
itself. This is the system 

X x n X = {X x X, B x B, A, T x T) 
where the relative product measure A = fi x n jj, is defined by 

A = / ficj x fjL u dF(uj). 
Note that A is supported on the fibered product 

UItt" 1 -^} x Tr" 1 !^} : u G 0} C X x X. 

Consequently fl is a factor of X X in a canonical way. 

By definition X is weakly mixing relative to T if X x is ergodic relative 
to T. What this means is that any invariant function for the relative product 
is ^"-measurable. There is an equivalent characterization as a (relatively) 
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mixing condition: T has a relatively mixing sequence {n^}, which means that 
for any sets A and B 

^(A n T-^B) - ^{A)^{T~ n *B) 

converges to in measure (as a function of u). Equivalently, T has a relatively 
mixing sequence of density one in N. The existence of a relatively mixing 
sequence of density one in the presence of relative weak mixing follows easily 
from [F], Lemma 7.6. The converse fact, that a relatively mixing sequence 
implies relative weak mixing, is easy since a relatively mixing sequence for 
X is also a relatively mixing sequence for X x n X and the existence of a 
relatively mixing sequence certainly forces relative ergodicity. 

We will present two versions of Krengel's theorem for relatively weakly 
mixing systems. The first says that by a uniformly small perturbation of a 
given partition it is possible to obtain one with the same distribution on a.a. 
fibers so that it has infinitely many independent iterates on a set of fibers 
with probability arbitrarily close to 1. "Probability" here means the measure 
on the factor, in accordance with the ideology of RDS theory. Consequently, 
"a.s." will mean "for P-a.e. wefi". 

For convenience we will state and prove the results in the case of ergodic 
systems. Theorem 1 remains valid without this assumption and so does The- 
orem 2, with a minor modification. The proofs are not much more difficult. 
After proving the results in section 3 we will make some comments about 
how to remove the ergodicity assumption. We remark that when T is weakly 
mixing relative to f2, ergodicity of T is equivalent to ergodicity of 9. 

Theorem 1. Let (X, B, fi, T) be an ergodic system with a factor (O, J 7 , F, 6) 
where \i decomposes as \i = J fi^dP. Assume {ni}^ is a relatively mixing 
sequence for T, and P is a finite partition of X for which there is an a > 
such that 

Vp G P : // w (p) > ct > for P-a.e. weO. 
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Then for every e > there exist a partition Q of X, a set E <z Q, F(E) < e, 
and a subsequence {rrii} of {rii}, such that 

dist Ma; (5= dist for P-a.e. uj (1) 

d^(P,Q) <e for F-a.e. uj (2) 

{T mi Q} is jointly independent with respect to /i^ \/uo E (3) 

The independence claim holds for a set of uj of measure arbitrarily close 
to 1, not P-almost everywhere. This is not a drawback of the method - if we 
require that Q be uniformly close to P then we cannot obtain independence 
on almost all fibers. For a counterexample consider the so-called (S, S~ l ) 
transformation in its simplest version. Let Q = Y = { — 1, 1} Z with the stan- 
dard product measure P coming from (1/2,1/2) distribution of probabilities 
on { — 1, 1}, and let 6 = S be the Bernoulli shift: (Sx) n = x n +\. The (<S, S~ x ) 
map is a skew product T on Q x Y defined by 

T(u,y) = (Su,S Uo y). 

The family {S^ } contains only two distinct transformations - the shifts to 
the left and to the right. The system is weakly mixing with respect to the 
base (it is even strongly mixing in the L 1 sense - see [R] for the definitions 
and the proof), but the independence a.e. cannot be achieved even along a 
set of two numbers, say {ni,n2}. 

To see this first observe that if T ni Q were ^-independent of T n2 Q for 
a.a. uj then, for n = n± — ri2, we would also have T n Q ^-independent of Q 
for a.a. uj. Hence we may as well take n\ = and ri2 = n. Choose a partition 
R of Y = £1 such that 

dist pj R= (1/2,1/2) and d F (R,SR) < 

Example: viewing R as a map into the index set {—1, 1} let R(y) = sign(^~| +1 yi)\ 
(Note that this sum cannot be 0.) Define P on X by P(uj, y) = R{y). Observe 
that 

T n {uj,y) = {6 n uj,S^ u;) y) 
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where 

n-l 
i=0 

Now let 5 = or 1 according to whether n is odd or even. Then the set 
E = {uj : 4>(co, n) = 5} has positive measure and for to G n E we will have 

d^(P,T n P) = d r (R 1 S 5 R)<± 
Now suppose Q is a partition of X such that for almost all to 

dist^Q) = (i i) and d Mw (Q,P) < 
Then it is easy to see that for a; £ n -E 

3 



d.(Q,T n Q)< 



100 



On the other hand if Q and T n Q were independent with respect to fi^ then 
we would have d IJiuj {Q^T n Q) = |, a contradiction. 

In our second result, by dropping the requirement that the perturbation 
be uniformly small we are able to obtain the desired independence on almost 
all fibers, rather than just a large set of fibers. 

Theorem 2. With the same hypotheses as in Theorem 1 one can find a 
partition Q and a subsequence {wij} of {n^} such that dist^^Q = dist Mu; P 
for a.a. to, d /Ji (P,Q) < e and the partitions {T mi Q} are jointly independent 
with respect to fi^ for almost every u. 

Remark. Of course to say that d M (P, Q) is small is just to say that d^^ (P, Q) 
is small for all but a small set of u. 

Our presentation conforms to the following plan. After a section on pre- 
liminaries and notation we prove in Section 2 a result (Propostion 1) on the 
existence of measures with prescribed marginals - a non-stationary general- 
ization of the stationary extension result Theorem 2 in [BJ]. It's proof mimics 
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closely the proof of the stationary result. Theorems 1 and 2 are proved in 
Section 3. 

1. Notation and Preliminaries 

In this section we introduce some notation and definitions that pertain 
to Sections 2 and 3. In the beginning of Section 3 more preliminaries are 
collected that are specific for the proof of the main result. 

First we adopt a more formal definition of an ordered partition. A (finite) 
partition of a measurable space (X, B) is a measurable map P from X to a 
finite index set A (so by definition we are dealing with ordered partitions). 
The sets p = P _1 (a), a G A, are the atoms of P and we write peP. If (X, B) 
carries a probability measure fi then dist P (or dist M P if the measure needs 
to be emphasized) denotes the measure \i o P _1 on A. If B G B, n(B) ^ 
then dist M (P|.B) refers to the restriction P\b and the normalized measure 
Hb- We write P -< Q (Q refines P) if each atom of P is a union of atoms of 
Q. For two partitions P and Q indexed by the same alphabet A the distance 
d(P, Q), or d M (P, Q), between them is 

d(P, Q) = fi{x G X | P{x) ^ Q(x)}. 

For a fixed alphabet A this metric defines a complete metric space of parti- 
tions. 

Two partitions of a probability space (X,/!) are independent if every 
atom p of the first one is independent of every atom q of the second one: 
fi{pf) q) = fi(p)fi(q). The definition of joint independence of more than two 
partitions is similar. If P and Q are finite partitions of a probability space 
(X, /i) then we will say P is <5- independent of Q whenever 

||dist(P|g) - distP||oo < 5 Vg G Q. 

(We are identifying a measure a on a finite set A with the vector {a(a)} a <=A ) 
When P is (5-independent of Q we will write P _L<5 Q (or P J_s,^ Q)- Note 
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that this definition of approximate independence differs from a more standard 
one which uses the ^-norm and allows a small exceptional set of g's. In fact 
any reasonable defintion of approximate independence could be used in the 
statements of our results but the one we have chosen is convenient for the 
proofs. We will make use of the fact that if P _L«5 Q and Q >- R then P _L<5 R. 

One more piece of notation: we write x ~ y whenever x and y are two 
elements of a space which are no more than 5 apart with respect to some 
metric or norm which is clear from the context. 

2. Building a measure with prescribed marginals 

This section is static - no dynamics are involved. The object of study 
is measures on A^ , where A is a finite set (alphabet) and E is countable. 
We prove a result on the existence of a probability measure with prescribed 
projections on A K for K G T , a certain family of finite subsets F of E. 

We will be working with measures on A H for various subsets H C E 
and we use the usual Borel structure on A H . Mostly H will be finite. Let 
Ki, K2 C E. Whenever K C L C E we will denote by P K the projection map 
from A L to A K , which is a finite partition of A L when K is finite. We will 
also write P K = ttk- also acts on measures: if m is a measure on A L then 
ttxtti = m o 7T _1 . m is then an extension of Trxm. By definition, if E C A K , 
then m{E) means the same as nKTn(E). Two measures on A Kl and A K2 are 
called consistent if their projections on A KinI<2 coincide. Consistency of a 
family of measures means pairwise consistency. 

For H finite we will say a probability measure m on A H is ^-independent 
whenever there is an indexing H = {hi, . . . , h n } such that 

P {hi} U,m P^'-^-i) Vi = 2, . . . , n. 

One easily sees that (5-independence of m implies 

\\m-p H \\co < (\H\ - 1)5, 
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where pjj is the product measure on A H that has the same one-dimensional 
marginals as m. Note also that if m is (^-independent then so is 7T/m for any 
I CH. 

Proposition 1. For every TV G N and a > there is a 5 = 5 (a, N) > such 
that the following assumptions imply the conclusion below. 

Assumptions: 

A is a finite alphabet, T is a family of finite subsets of E and for every 
set K G T a probability measure fix on A K is given. These data satisfy: 

(F) Vn G E we have | U {K e J 7 : n e K}\ < N 
(Ml ) The measures fix are pairwise consistent, 

(M2) Every atom of each 1-dimensional marginal of each measure \ik has 

measure > a. 
(M3) Every measure \±k is 5 -independent. 

Conclusion: 

The measures {^k} ket have a common extension to AP . 

Remark. Evidently there would be no loss of generality in assuming in the 
proposition that UJ 7 = E. 

Proposition 1 is an immediate consequence of the following claim. 

Claim A. For every N G N and a > there is a 5 = 5(a, N) > and a 
(3 = (3(a,N) > such that the following implication holds. Suppose that the 
assumptions of Theorem 1 hold. Suppose further that I is a finite subset of E 
and A is a measure on A 1 that is consistent with every fix and (3 -independent. 
Then there is a measure X' on V that extends X, and is again consistent with 
every fix and (3 -independent. 

Remark. Clearly 5 < (3. Typically S <C f3 <C 1. Explicitly, one can take 
(3 = a N /2N, as will be seen from the proof. The expression for 5 = 5(a,N) 
can also be developed in principle but we will not need it. 
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Proof: To prove Claim A we will need two lemmas. These are the Lem- 
mas 2.1 and 2.2 from [BJ], the first of which appeared already in [JRW] 
(Proposition 1.2). To make our presentation self-contained, we provide 
sketches of the proofs here and refer the reader to the previous papers for 
more details. 

The first lemma deals with signed measures, for which the notions of 
restriction, extension and consistency remain meaningful. 

Lemma 1. Let K be a finite set and {Ki} a finite family of subsets of K 
whose union is K. Any consistent family of signed measures on A Ki has a 
common extension - a signed measure on A K . 

Sketch of the proof. We are given the measures /zk\, i = 1, . . . , n. Pick an 
arbitrary probability measure q on A K . For every J C {1, . . . , n} denote by 
fij the common projection of /J>Kj, j £ J, onto A n ^ K ^'^ Gj \ multiplied by 
7T{k .j e jyq. One common extension is 

0A7C{l,...,n} 

This expression is inspired by the inclusion-exclusion formula in combinatorics 
and the proof is based on the same idea - when the expression for \i is pro- 
jected to any of the A Ki 's the sum over all J ^ {1, . . . n} can be divided into 
pairs that cancel out. 

■ 

Lemma 2. Suppose V and W are finite-dimensional real normed vector 
spaces and II: V — > W is any surjective linear map. Then there is a con- 
stant C = C(LT) with the following property: ifv^V and w = H{v) ^ then 
II has a right inverse B (Y1B = id\y ) such that Bw = v and \\B\\ < C||f;||/||w;|| . 

Sketch of the proof. Using the equivalence of any two norms in a finite- 
dimensional space it is not hard to see that there is no limitation of generality 
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in the assumption that V is Euclidean and II is the orthogonal projection 
onto its subspace W. In this case the result is immediate (C = 1). 



Continuing with the proof of Claim A, we shall assume, as we may, that 
[JJ 7 = E. Fix n G £\J, denote V = I U {n}, = {K G | n G if} and 
consider the following two families of subsets of E: 

<S={ifn/'|if G.F n }, 

ft = {K n / I if G ^n} = {5\{n} | S G 5}. 
<S and 1Z are finite families of sets. All sets in S contain n. Let 

S = US and R = U1Z. 

Clearly R = S\{n], S C i 7 , R C /, and by (F) the cardinality of S does not 
exceed N . 

We are going first to construct a probability measure a on - a prospec- 
tive marginal of A'. It necessarily has to satisfy the three conditions derived 
from the corresponding conditions imposed on A' by Claim A, namely 

ttkdI'O- = TTKnr^K for every K G T n , (2) 

P {n} ±p,a P*. (3) 

Let us assume for the moment that we have already constructed a satis- 
fying (1), (2) and (3). Because of (1) we can then define A' on A 1 to be the 
relative product measure A' = Ax^<7, that is 
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A (#) = 77 r 2 — vx e A . 

X{tt^x) 

X' is certainly an extension of A and of a. X' is consistent with every px, 
K G T, or equivalently A' is an extension of 'Kkcm'^k-, because either a or A 
is such an extension (depending on whether n G K or not). 

Moreover the definition of A' together with (3) implies that for all X G 
p{n} ? Y g and Z G PA« we have 

A'(X|y nZ) = £ <7p0 = A'(X). 

This means that P^ ±(3 iX ' P 1 , and since A is /3-independent it follows that 
A' is also /3-independent, completing the proof of Claim A and Proposition 1. 

We now proceed to construct a. For each S = K fl I' G S and for each 
R = K HI ell, where K G T n , let 

Ms = ksVk and = ir R p K . 

By the consistency of the measures px these requirements unambiguously 
define ps and ^ even though K\ n I' = K 2 fl I' does not imply K\ = if 2 - 
Moreover, both {ps}ses and are consistent families of measures. 

We let V denote the vector space of all signed measures p on A R and W 
the space of all consistent families {pr}r£-r,, where pr is a signed measure 
on A R , both spaces endowed with the ^-norm. We let II : V — > W denote 
the projection map, that is II (p) = {iTRp^Ren for each p G V, so we have 
n(7T7^A) = {vr}r<= l 'r.- Lemma 1 tells us that II is surjective so by Lemma 2 
we conclude that II has a right inverse B such that 

B{{vr}rzk) = tt^A, 

and 

||5|| <C||7r^A||/||{^}^||, 
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where C = C(II) is a constant depending on II. However, II is completely 
determined by specifying the sets R G 7£, which are subsets of R, and by the 
condition (J 7 ) the cardinality of the latter is less than N, so there are less 
than 2 N subsets. Therefore there are less than (2^) 2 possibilities for the 
projection II. This means that we may take C to be a constant depending only 
on N . Since ir^X is a probability measure we have Utt^tA))^ < 1. Moreover 
the hypothesis (M2) ensures that \A\ < a -1 . Since each vk is a probability 
measure on A K ', a set of cardinality less than \A\ n , we have 

so we obtain an absolute bound \\B\\ < Ca~ N = C . 

Recalling that RL){n] = S, we view each y G A s as a pair y = (x,a), x G 
A R , a G A, and use a similar convention for the sets A s , S G S. With this 
convention, if a is a signed measure on A s and a G A then cr(-, a) is a (signed) 
measure on A R . Note that the family {/j-s}s<es can be rewritten in the form 
{^u{n}}flGK) an d A i i?u{n}( - ? o) is a measure on A R for each R £ 1Z. We now 
define cr by specifying that 

a(-,a) = B({fj, RU{n} (-,a)} Ren ) Va G A. 

Note that the consistency of the family {/J>Ru{ n }{-, cl)}r£1I follows from the 
consistency of the family {/J*R{j{ n }}Ren- We then have 

tt r (t = ^cr(-,a) 

= X] 5 ({^i?u{n}(-,a)W^) 

= 5 ^{feu{ n }(',fl)}i?eu j (since B is linear) 

= B({v R } Ren ) = TT^A, 
establishing (1). To check (2) observe that for each a £ A and R G 1Z we have 

(tt.ru {„>cr)(-,a) = 7r fl (cr(-, a)) = 7r fl B({//Q U{n} (-, a)} Qen ) = // fl (-,a), 
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since UB = idjy. 

It remains to check that a is non-negative and satisfies (3). Note that 
all vr and /J>R\j{ n }i R £ are ^-independent. Denote generically by pl the 
product measure of 1-dimensional marginals on A"*, I E L. In particular p{ n } 
is the common 1-dimensional marginal of all measures in T n , ). Now recall 
that for R E K 

\\^RVJ{n} ~ PRU{n} II oo < ( \R U {n} \ - 1)5 = \R\S < NS. 

Similarly \\ur — Pr\\oo < N5. Therefore 

HRu{n}{x,a) ~ S P{ n }{a)p R (x) 

~ 5 P{n}{a)vR{x) VR E 11, x E A R ,a E A. 

This means (since we are using Zoo-norms) that for all a E A 

{^RU{n}{-,d)} R( zTZ P{n}(a){vR}R€K, 

so using \\B\\ < C and linearity of B we obtain 

cr(-,a) 2 °^ S p {n} {a)B({v R } R(E7 z) = p {n} (a)7T^X. (4) 
Since tt^X is /3-independent and \R\ < N, for each x E A R we have 

7Tft\(x) *~ Pr-{x) > Oi N . 
Taking (3 = ^ it follows that 7r-^A(a;) > a N /2 for all x E A R . Combining 

a N 

this with (4) and taking 5 = we get 



cr(x,a) . . 

P{n}{a) 



< = p VxeA r , aEA. (5) 



7T^X{x) 

Since P{ n }{a) > a and (3 < a/2 this shows that a takes only positive values 
so it is a probability measure. (5) also shows that -Lp^ P R , establishing 
(3) and concluding the proof of Claim A and Proposition 1. 
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3. Relatively Independent Iterates of a Partition 

If T is an ergodic invertible measure-preserving tansformation of X, P is 
a partition of X and K is any subset of Z then the partition P K , indexed by 
A K , is denned by 

[P^(V)p) = P(T fc £), for z G X, fc G K, 

Informally, P K (x) = {P(T k x)}keK, the function P evaluated along the K- 
orbit of x. P K (x) is called the P, K-name of a:. Note that the atoms of P K 
are the atoms of the common refinement of the partitions {T~ k P : k G K}. 
For this reason we it is convenient for us in this section to restate Theorems 
1 and 2 in terms of sequences of negative times which are mixing, that is we 
replace T with T _1 . Note also that {rii} is relatively mixing if and only if 
{— rii} is relatively mixing. 

We will say that P is independent over K if the partitions {T~ k P, k G 
K} are jointly independent, equivalently distP x is the product of its one 
dimensional marginals. We will use the same terminology in the relative 
setting, that is with respect to the fiber measures fi^. Similar observations 
apply to approximate independence, for example P^ is (5-independent of P K 
if and only if T~ l P is ^-independent of \J keK T~ k P. 

Theorem 1. Let (X, B, (i, T) be an ergodic system with a factor (O, JF, P, 6) 
where fi decomposes as /i = J fi^dF. Assume {— n^j^o iS an Q-mixing se- 
quence for T and P is a finite partition of X for which there is an a > such 
that 

forP-a.e.cu G QS/p G P we have ^(p) > a. 

Then for every e > there exist a partition Q of X, a set E C Q, ¥(E) < e, 
and a subsequence {mj of {rii}, such that 

dist^Q = dist^P for F -a.e. u, (1) 

d^(P,Q)<e for P-a.e. u (2) 
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and 

P is independent over {mj with respect to for all to E. (3) 

Proof: There are two cases: 9 has a set of periodic points of positive 
measure, and 9 is aperiodic. In the first case, by the ergodicity of 9 it must be 
a rotation on a finite number k of points. By dropping to a sub-sequence we 
may assume that the rii are all congruent modulo k, say rii = n^+r, where the 
n't are multiples of k. The independence we are aiming for over a subsequence 
7ii j (which is now on every fiber) is then equivalent to independence over 
m! i . , so we may as well assume that the rii are themselves multiples of k, say 
n<i = Qik. For each to T k preserves the measure \i u and {qi} is mixing for 
(T^fi^). Fix any ujq and apply the non-relative version of Theorem 1 ([BJ] 
Theorem 2, also [JRW] Theorem 5) to (T fc ,// Wo ) to modify P by a small 
amount on the fiber over cjq, obtaining a new partition and a subsequence of 
{qi} which give the desired independence with respect to fj, UQ . Now repeat 
the process in the fiber over 9ujq with the new partition and subsequence to 
obtain the desired independence with respect to nou^. Iterating the process 
until we get to 9 k ~ 1 cuo concludes the argument in the periodic case. 

In the aperiodic case we will proceed inductively starting with Q$ = P, 
mo = 0, Eq = and building the sequences {Qj}, {rrij} and {Ej} so that for 
all j > 1, F(Ej) < e/V and 

(1') dist^Qj = dist^Qj-i for P-a.e. u, 
(20 d^Q^Qj-!) < e/2? for P-a.e. u, 

(3') Q is jointly independent over {mj}- =1 with respect to n u for all to £ Ej 
where Ej := Ej-i U Ej = uj =0 Ei. 

Assume for a moment that this construction has been carried out. Then 
defining 

E = U«L Ej = U«L Ej 
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(so ¥(E) < e) and Q = Km Qj we get (1), (2) and (3) satisfied. The limit is 
taken in the symmetric difference metric in the space of partitions. The space 
is complete, and the sequence of partitions is Cauchy by (2'). Therefore we 
are done. 

The proof of the theorem is thereby reduced to the induction step. For 
convenience we now get rid of unnecessary subscripts and constants and re- 
state in equivalent form what remains to be proved. Use the assumptions 
of the theorem and let K be a subset of N U {0} containing so that P is 
independent over K with respect to fi^ for all u outside some E c Q. 

Given e > we are looking for m E {n^}, m > max if, a partition Q of 
X and E C fi, F(E) < e, such that 

(1") dist = dist ^P for P-a.e. u, 
(2") d^(P,Q) <e for P-a.e. to, 

(3") Q is independent over K U {m} with respect to n u E U E 

Denote \K\ = k and let 7] = ^ - 2 + 5e where 5 = 5{a,k 2 + 1) from 
Proposition 1 (in section 2) - this choice of 77 will become clear later on. By 
the hypothesis, we know that, for to £ Q\E, P is independent over K with 
respect to /Lt w , in other words dist fJ , UJ P K is the product of its one-dimensional 
marginals. Using the mixing property of the sequence {— rii} pick m £ {n^} 
such that m > max if and for to E\, P(-E'i) < 

P m ± V a P K - 

Let Bq c Q, be the base of a Rokhlin tower R for 6 of height M where 
M > 10m/ e, with error set 

E 2 = £l\{J{0 i B n :0<i<M} 

of measure less than The tower R can be lifted to X by 7r _1 , where tt 
is the factorization map. This tower in X can informally be regarded as an 
"integral of towers" with bases B w = 7r _1 {(X>}, to G B, and this observation 
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inspires the construction that follows. Let M — [0,M — 1]. With u G 
fixed, its M-orbit {O^lo}^^ 1 gives rise to the T-tower R u whose levels are the 
fibers tt~ 1 (9^uj) endowed with measures \iqo u . 

Let v u = dist Mu; P M , a probability measure on A M . Denote K' = K U 
{m}. For all j that satisfy j + K' C M let = 7rj+K> . For those values 
of j for which O^u EUEi the measure dist on A^ +K is ^-independent, 
by our choice of m and because its marginal on A^ +K is exactly independent. 
We now aim at perturbing P to obtain Q that satisfies (1") and (2 // ) and for 
which dist fJ/0J Q : ' +K is exactly independent for all combinations of to and j 
corresponding to 6^ to outside of a set slightly larger than E U E\. 

Fix an u G and consider the tower R u . Denote its base 7r _1 {cc>} by 
B. Partition B into two parts B = Bo U B\, ^{Bq) = j^, independently of 
M-names of the points in B (i.e. after proper normalization dist Mcj P M on 
each of B and B\ is identical to that on B). This can be done since the fibers 
of 7T are non-atomic by the mixing property of T. 

We will construct the required partition Q by re-assigning points on the 
M-tower above Bq to the letters of the alphabet A (the technique known as 
"painting names on towers"), while retaining this assignment on the M-tower 
above B\. We now explain how the re-assignment is done. (Recall that u is 
fixed for the time being.) 

Let rriuj denote the projection of v u onto A^ 1 a probability measure on 
A^i} which is the same as dist ^ ejuj P (up to identification of A^ with A^). 
Of course the one-dimensional marginal of u u j on A^ +k ^ for any particular 
k is rriuj+k- For all j satisfying 

< j < M - m and 6 j u <£ E U E 1 (*) 

we define a "correcting measure" on A^ +K by the following equation: 

(1 - (e/10))v Utj + (e/10)^,, = J] m Utj+k . 

k£K' 

The idea is that blending a small fraction of into corrects it from 
approximate to exact independence. A priori is a signed measure but we 
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shall see shortly that it is in fact positive. Observe that for any k G K' the 
marginal of on A^ +k is m W) j + fc, since that is the case for both v^j and 

IlfcGK' m u,j+k- 

The definition of £ w j is equivalent to 

fu,,j - m^j+k = ((10/e) - 1)( IJ m ^+ fc - ( 4 ) 

By our choice of m the ^-norm of the right hand side of (4) is less than 

10 c a k+1 a k+1 
—7] = 5—— < ——. 
e 2 2 

Since the minimal value of Ilfceif' m u,j+k on singletons in A^ +K is at least 
a k+1 we conclude that is indeed positive. 

Projecting the relation between the measures from A^ +K to A^ +K we 
conclude that tt^+k^j is the product of its one-dimensional marginals 
m w j + /j, since this is true for n^j+KU^j. In addition, we claim that for all j 
satisfying (*) the measure is (5-independent. In view of the mutual inde- 
pendence of the first k (out of the total of k + 1) one-dimensional marginals 
of this measure it would be enough to prove that for all x G A^ +K and a G A 
we have 

\^,j{x,a)/7T A j+K^j(x) - 7T A{j+m} ^j{a)\ < 5. 

(As in Section 2 we view y G A^ +K as a pair y = (x, a), x G A^ +K ,a G 
^O+m}.) This is achieved by our choice of 77, as the following calculation 
shows. Evaluate (4) at (x,a) and then divide by 



to obtain 



\^uj,j(x,a)/7T Aj +K^ UJ j(x) - , K A {j+m}i UJ ^{a)\ 

— I JJ TT _ 7r A{i+™} z/ w,il a J 

v e ; 7r A i+x^ ;i (a;) 
10 1 . a k+l ^ . 

< T'!o Je — ^ 
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Proposition 1 of Section 2 can now be applied to the family T of those 
shifts j + K' of K' for which j satisfies (*), endowed by the measures (F) 
holds for N = k 2 + 1. (Ml), the consistency condition for the measures 
is implied by the consistency of v^j- (M2) holds because the 1-dimensional 
marginals of are the same as for v^j- (M3) has just been demonstrated. 
So there is a measure £ w on A M that is a common extension of all the measures 
in the family. Write the names on the levels of the T-tower above B as 
prescribed by Explicitly what this means is we choose a partition R of Bo 
indexed by A M so that dist (R\B) = £ w and thenwe partition the tower over 
B according to the rule: for each x E B and j E M let Q{T^x) = (R(x))(j). 
To complete the construction of the partition Q on the whole M-tower above 
B recall that this partition coincides with P above B\. 

The procedure we described depends on the point to E Bq. Performing it 
for every to E Bq we get a partition Q of the whole 7r _1 (f2\£ , 2). On 7r~ 1 (E 2 ) 
just declare Q := P. One only needs to assure that Q can be chosen im- 
measurable. This follows from the fact that for any measurable A C X and 
< t < 1 there is a set B C X such that = t for a.a. to and ^4 and 
B are independent with respect to fi^ for a.a. to. Using this and following 
every step of the procedure it is easy to convince oneself that everything can 
be done in a ^-measurable way. 

Having done all this, we obtain a partition Q of X that clearly satisfies 
(1") and (2"). (1") holds because the 1-dimensional marginals of the measures 

and are the same. (2") is satisfied even with e/10 instead of e. By 
virtue of the construction, (3") is satisfied for all to outside E U EiU E 2 U E%. 
Ei and E 2 have been defined above, while -E3 is the set of m uppermost levels 
in the M-tower for 9 with base B n . So F(E 3 ) < m/M < e/10. Therefore 
taking E = E\ U E 2 U E 3 we obtain the required version of (3"). 



Remark. The proof shows that, as in [JRWjand [JB], the independence can 
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be achieved not just along the sequence {mj, but also along the IP-set which 
it generates, namely the set of all finite sums of the form '52 ieF rm, where F 
is a finite subset of N. 

Theorem 2. With the same hypotheses as in theorem 1 there is a Q and a 
subsequence {mj such that dist Q = dist^P for a.a. u, d(P,Q) < e and 
Q is independent over {rrii} with respect to fi^ for almost every u. 

The proof is a modification of the proof of Theorem 1, using the following 
lemma. 

Lemma 3. Suppose K is a finite subset of N and P is a partition which is 
independent over K with respect to fi^ for all u ^ E where E is some subset 
of fl. Then for any e > there is a partition Q such that P and Q agree on 
all fibers not in KE = U{T k E : k G K}, dist^P = dist^Q for all to and 
there is a subset F C with n{F) < e such that Q is independent over K 
with respect to fi^ for all to F. 

Proof: Let B C O, be the base of a Rohlin tower of height n ^> k = 
max if. Fix cjq G B. As in the proof of Theorem 1 we will describe how to 
change P on the tower over 7r _1 {cc>o}, with the understanding that this should 
be done simultaneously for each u) G P, in a measurable varying way. 

Fix any i\ such that < i\ < n — k and 9 %1 ujq G E. Now change P on each 
7r _1 (# n+fc cc>o), k G K, retaining the same distribution on each of these fibers, 
to obtain a new partition Pi so that, with respect to fi^o, Pi is independent 
over ii + K and in addition p* 1+K is independent of p^°' n ^ l+K \ This is 
possible simply because the fibers are non-atomic. 

Of course we will then have that P\ is independent over K with respect 
to fi0i luJo . In addition, for any i ^ i such that < i < n — k and 6 l coo P, we 
will still have that P^ is independent over K with respect to fi0i UJo . Indeed, 
with respect to fi UJo , P\ is independent over S\ = i + K fl {i\ + K ) c , since P 
has this property while P\ agrees with P over the relevant fibers. Also with 
respect to ^ Wo , Pi is independent over $2 = i + K fl (ii + K), since Pi is 
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actually independent over all of i\ +K by construction. Finally P 1 1 and P 1 2 
are independent with respect to fi LOo , again by construction of Pi. This shows 
that Pi is independent over i + K with respect to fi UQ which implies that Pi 
is independent over K with respect to jU.#i Wo 

Now replace i\ with any Z2,0 < %i < n — k such that we still do not 
have the desired independence with respect to fi0i 2u j o an d modify Pi to P2 
as before to achieve the independence with respect to H0i 2u j o - Continuing in 
this way we will arrive at a partition Q which has the desired independence 
with respect to fi0i UJo for every i, < i < n — k. Note also that Q differs 
from P only on fibers over over KE. Doing this for all ljq E B we obtain the 
desired independence with respect to fi^ for all to E B % B 1 < i < n — k and 
the measure of this set of w's can be made as small as we please. 

■ 

Corollary 1. Lemma 3 holds with e = 0. 

Proof: Iterating Lemma 3 with a summable sequence of e's produces a 
sequence of partition which, by the Borel-Cantelli lemma, eventually stabilizes 
over to for a.a. to. This gives a limiting partition Q with the desired properties. 

■ 

Now it is to prove Theorem 2: in the proof of theorem 1 at each stage, 
after achieving the independence of P& over {mi, . . . , ra^} with respect to fi^ 
for a large set of u, use Corollary 1 to change P& over a small set of to to 
achieve the desired independence over a.a. u. This concludes the proof of 
Theorem 2. 



We now sketch how to remove the ergodicity assumption in theorems 1 
and 2. Let us first observe that it is enough to prove the result in the case 
when 6 is periodic with period n E {1,2,..., 00}. Indeed, supposing that we 
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have done this, partition Q into the invariant sets fi n , n = 1, 2, . . . , oo where 
# has period exactly n. We start in £1^ and find a Qoo uniformly close to 
P over and equal to P over f2\f2oo, and a first subsequence {rrii} which 
gives the desired independence in the fibers over most of Qqq. Then we work 
over Qi and find Qi close to Qoo and a further subsequence which gives the 
desired independence on most of Q,\. We continue in this way, perturbing the 
partition and refining the subsequence, for finitely many steps until we have 
exhausted all but a small invariant set E C consisisting of some tail of the 
sequence {0, n }. For Theorem 1 we only need independence on all but a small 
subset of Q so we are done. In the case of theorem 2, if we are willing to relax 
the requirement that the perturbed partition have the same distribution as 
the original one on all fibers, we can get the desired independence on fibers 
over E by simply making the partition trivial on those fibers. 

So, we may now assume that 6 has constant period k, possibly infinite. 
In both theorems 1 and 2 the case k = oo is identical to the ergodic case since 
aperiodicity is all that is needed for the Rohlin lemma. In the case when 
k is finite, Theorem 1, the same congruence trick used in the ergodic case 
reduces us to working with T k so we may assume that 6 = id^. So we have 
reduced ourselves to proving the non-ergodic versions of Theorems 1 and 2 in 
the case when 6 is the identity. In the case of Theorem 1 this is just a slightly 
souped up version of the absolute theorem where instead of having just one 
weakly mixing system one has a whole measurable field of them. It is quite 
straightforward to prove this by suitably modifying the proof of the absolute 
result. To actually carry it out would require us to delve into the proof of 
the absolute result. This is the reason that we chose to prove theorems 1 and 
2 formally only in the ergodic case. In the case of Theorem 2 we again just 
make the partition trivial in the small set of fibers where Theorem 1 does not 
give independence. 

The observant reader will have noticed that Theorems 1 and 2 do not 
quite generalize Krengel's theorem in that we don't insist that the subsequence 
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{rrii} start with mi = 0. The place where our argument would break down if 
we wanted mi = is at the congruence trick used to deal with the periodic 
case. The results may well be true with mi = and in any case our argument 
shows that they do hold at least when 9 is aperiodic. 
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